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Abstract 



> 

\^ • Intermediate-scale (or 'meso-scale') structures in networks have received considerable at- 

\ tention, as the algorithmic detection of such structures makes it possible to discover network 

■ features that are not apparent either at the local scale of nodes and edges or at the global scale 

, of summary statistics. Numerous types of meso-scale structures can occur in networks, but 

04 ■ investigations of meso-scale network features have focused predominantly on the identification 

and study of community structure. In this paper, we develop a new method to investigate the 
meso-scale feature known as core-periphery structure, which consists of an identification of a 
network's nodes into a densely connected core and a sparsely connected periphery. In contrast 
to traditional network communities, the nodes in a core are also reasonably well-connected to 
I those in the periphery. Our new method of computing core-periphery structure can identify 

multiple cores in a network and takes different possible cores into account, thereby enabling 
a detailed description of core-periphery structure. We illustrate the differences between our 
method and existing methods for identifying which nodes belong to a core, and we use it to 
classify the most important nodes using examples of friendship, collaboration, transportation, 
and voting networks. 

Submitted for publication. 

1 Introduction 

Networks are used to model systems in which entities, represented by nodes, interact with each 
other. When representing a network as a graph, all of the connections are pairwise and hence 
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represented by ties known as edges [I112]- Such a representation has led to numerous insights in 
the social, natural, and information sciences, and the study of networks has in turn borrowed ideas 
from all of these areas [3]. 

Networks can be described using a mixture of local, global, and intermediate-scale (meso-scale) 
perspectives. Accordingly, one of the key uses of network theory is the identification of summary 
statistics for large networks in order to develop a framework to analyze and compare complex 
structures [1]. In such efforts, the algorithmic identification of meso-scale network structures allows 
one to discover features that might not be apparent either at the local level of nodes and edges or 
at the global level of summary statistics. 

In particular, considerable effort has gone into algorithmic identification and investigation of 
a particular type of meso-scale structure known as community structure [HE]; in which cohesive 
groups called "communities" consist of nodes that are connected densely to each other such that 
the connections between nodes in different communities are comparatively sparse. Myriad methods 
have been developed to detect network communities [IHZ! , including ones that allow such groups to 
overlap with each other [SHlOj. These efforts have met with some success, as they have led to insights 
in applications such as committee [11] and voting [12] networks in political science, friendship 
networks at universities [13] and other schools [H], protein-protein interaction networks [15], and 
mobile phone networks |16j . 

Although (and arguably because) community structure has been very successful [HIS], the inves- 
tigation of other types of meso-scale structure — often in the form of different "block models" [5] — 
has received much less attention than it deserves. The type of meso-scale network structure that 
we consider in the present paper is known as core-periphery structure and was first proposed by 
Borgatti and Everett in 1999 [17] . There have been a few attempts to advance this methodol- 
ogy |17H19] , but they typically rely on implementations of the original method |17j in the software 
package UCInet [20], and there has been little other development since. 

A core-periphery structure of a network is a division of the nodes into a densely connected 
core and a sparsely connected periphery. The nodes in the core should also be reasonably well- 
connected to the nodes in the periphery, but nodes in the periphery are not well-connected to the 
core. Hence, a node belongs to the core if and only if it is well-connected both to other nodes in 
the core and to nodes assigned to the periphery. A core structure in a network is thus not merely 
densely connected but also tends to be "central" to the network in terms of short paths through 
the network. The latter feature also helps to distinguish core-periphery structure from community 
structure. Importantly, networks can have nested core-periphery structure as well as both core- 
periphery structure and community structure, so it is desirable to develop algorithms that allow 
one to simultaneously examine both types of meso-scale structures. 

Several results underscore the importance of considering core-periphery structure in addition to 
community structure. For example. Fan Chung and Linyuan Lu [21] showed that power-law random 
graphs, in which the number of nodes of degree k is proportional to , almost surely contain a 
dense subgraph that has short distance to almost all other nodes in the graph when the exponent 
f3 € (2,3). This suggests that it is sensible for networks with heavy-tailed degree distributions to 
contain some sort of cohesive core, and there is strong evidence that this is indeed the case in many 
real- world networks (such as many social networks and the World Wide Web) [T|[2t[22]. 

Additionally, nodes of particularly high degree, called 'hubs', occur in many real-world networks 
and can pose a problem for community detection, as they often are connected to nodes in many 
parts of a network and can thus have strong ties to several different communities. For instance, such 
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nodes might be assigned to different communities by the use of different computational heuristics 
apphed to the same notion of community [23], and it becomes crucial to consider their strengths 
of membership across different communities (e.g., by using a method that allows overlapping com- 
munities) [9lll0j. This suggests, moreover, that in such situations the usual notion of a community 
might not be ideal for achieving an optimal understanding of the meso-scale network structure that 
is actually present and that considering hubs to be part of a core in a core-periphery structure 
might be more appropriate |24j . 

The rest of this paper is organized as follows. We first describe previously proposed methods 
for detecting core-periphery structure in networks before presenting our new method, called Core 
Score, which computes a continuous value along a core-periphery spectrum and thereby yields a 
centrality measure based on core-periphery structures in the network. We illustrate our method 
using a set of synthetic (computer-generated) benchmark random networks with a known core. We 
then apply our method to several real-world networks: the Zachary Karate Club, collaboration 
networks of network scientists, and United States Congressional voting networks. We conclude by 
summarizing our results and include additional results — including an application to the London 
Underground ('The Tube') transportation network — in an online Supplementary Information. 



2 Detecting Core-Periphery Structure 

2.1 Existing Methods 

Intuitively, one expects many real- world networks to possess some sort of core-periphery structure 
as part of their meso-scale structure. Methods proposed to measure the core-periphery structure 
of a network include block models [17], /c-core organization [18], and aggregation of information 
about connectivity and short paths through the network |19j . 

The notion of a core-periphery structure was formalized in social networks by Borgatti and 
Everett in 1999 [17], who proposed algorithms for detecting both discrete and continuous versions 
of core-periphery structure in weighted, undirected graphs. Their discrete notion of core-periphery 
structure is based on comparing a network to an ideal block model that consists of a fully-connected 
core and a periphery that has no internal edges but is fully connected to the core. Their algorithm 
aims to find a vector C of length n whose entries can either be 1 or 0. The ith entry Cj equals 1 if 
the corresponding node is assigned to the core, and it is if the corresponding node is assigned to 
the periphery. Let Cij = 1 if Cj = 1 or Cj = 1 and Cij = otherwise, and define 

pc = Y,^ifiv^ (1) 

where Aij are the elements of the graph's adjacency matrix. The matrix element Aij represents 
the strength (i.e., weight) of the tie between nodes i and j, and it equals if nodes i and j are 
not connected. The Borgatti-Everett algorithm seeks a value of pc that is high compared to the 
expected value of /3 if C is shuffled such that the number of 1 and entries are preserved but their 
order is randomized. The output is the vector C that gives the highest z-score for pc- 

As a variation of the above discrete notion of core-periphery structure, Borgatti and Everett 
defined 



1 , if Cj and Cj = 1 



a G [0, 1] , if Ci = 1 or Cj = 1 , (2) 
, otherwise. 
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They also defined a continuous notion of core-periphery structure in which a node is assigned a 
'coreness' value of Ci and Cij = Ci x Cj = a. Our method to study core-periphery structure in 
weighted, undirected networks is motivated by the continuous formulation of Borgatti and Everett. 
Before discussing our approach, we call attention to two other methods in the literature. 
In 2005, Holme defined a core-periphery coefficient [18] 



Cc (ycoreiG)) / Cc (Vbore(G")) 



''^'^''^ - Cc{V{G)) - \cWW)r I G'^g^G) ' 

where the angled brackets indicate the average and Q{G) is an ensemble of graphs with the same 
degree sequence as the unweighted, undirected graph G and 

Cc(?7) = (/(P(.,i)),,^..,.\. y\ (4) 



and P{i,j) is the distance (i.e., number of edges in the shortest path) between nodes i and j. A 
/c-core of the graph G is a maximal connected subgraph in which all nodes have degree at least k, 
and Vcore is the fc-core with maximal Cg{U). Using A;-cores to examine core-periphery structure 
is computationally fast (and we note that one could, in principle, generalize Holme's method for 
weighted graphs), but it entails very strong restrictions on the notion of a network core. 

One expects the core of a network to have high connectivity to other parts of the network, so 
Da Silva et al. introduced a measure of connectivity known as network capacity [19j : 

M 

K = Y.pr\ (5) 

1=1 

where M is the total number of connected pairs of nodes and Pi is the length of the shortest path 
between the /th pair of nodes. Da Silva et al. then defined a core coefficient as cc = N' /N, where N 
is the total number of nodes in the network, N' satisfies X^^=o ~ ^-^ '^u=o ^^'^ 
capacity of the network after the removal of m nodes. The nodes are removed in order of closeness 
centrality, which is defined as the mean shortest path from a node to each of the other nodes in 
a network [19]. Note that in the remainder of this paper, we will use the following definition for 
closeness centrality of a node j (there are several different ones available in the literature): 



N 

where P{i,j) is the sum of edge weights in a shortest path in the context of weighted networks. Da 
Silva et al. considered only binary networks, but their method can be generalized straightforwardly 
to weighted networks. 

2.2 Our Method 

Our method to study core-periphery structure in weighted, undirected networks is motivated by 
the continuous formulation of Borgatti and Everett described above. 

We allow the vector C to take non- negative values, and we let Cij = Ci x Cj. We seek a core 
vector C that is normalized (so that the entries sum to 1) and is a shuffle of the vector C* whose 
components specify the local core values 

n* — \ (a\ 

l + exp{-(m-7V/3) X tan(7ra/2)} ' ^' 
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such that the core quality 



R — ^ ^ AijCiCj 



(7) 



is maximized. There are two parameters: a € [0,1] and (3 € [0,1]. The parameter a sets the 
sharpness of the boundary between the core and the periphery. The value a = yields the fuzziest 
boundary and a = 1 gives the sharpest transition (i.e., a binary transition): as a varies from to 
1, the maximum slope of C* varies from to oo. The parameter /3 sets the size of the core: as /3 
varies from to 1, the number of nodes included in the core varies from N to 0. We define the 
total core score of each node i as 



where T(i) is the set of neighbours of node i and the normahzation factor Z is chosen so that 
ma,Xk[CS{k)] = 1. We note, additionally, that core score gives a new notion of network centrality 



In the results presented in this paper, we assign the values of C*(a,/3) to maximize R using 
a simulated-annealing algorithm [25] (see the online Supplementary Information for details of the 
procedure). Other algorithms may be faster, especially those in which the choices for a and /? 
depend on the structure of the network. The purpose of this paper, however, is to introduce 
a novel notion of core-periphery structure, so we leave the question of which algorithm is most 
efficient to future work. 

3 Synthetic Benchmark Networks 

We test our method on an ensemble of random networks with an artificial core-periphery structure. 
In designing these synthetic networks, we are motivated by the fact that networks can have both 
community structure and core-periphery structure. In Fig. [H we show images of the adjacency 
matrices of different idealized block models, which illustrate (a) community structure, (b) core- 
periphery structure, (c) a global core-periphery structure with a local community structure, and 
(d) a global community structure with a local core-periphery structure. By permuting rows and 
columns of the adjacency matrix, one can see that (c) and (d) are, in fact, equivalent. 



Figure 1: Different network block models, (a) Community structure, (b) core-periphery structure, 
(c) global core-periphery structure with local community structure, and (d) global community 
structure with local core-periphery structure. Note that (c) and (d) are equivalent. 
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3.1 Artificial Core-Periphery Structure 

We develop a family of synthetic networks that only have a core-periphery structure [see Fig. [T]|^b)], 
and we call these networks CP{N,d,p,k). (We will consider networks with both core-periphery 
structure and community structure when we examine real- world networks.) Each network in the 
ensemble CP{N,d,p,k) has N nodes, where dN of the nodes are core nodes, (1 — d)N of the 
nodes are periphery nodes, and d € [0, 1]. The edges are assigned independently at random. The 
edge probabilities for periphery-periphery, core-periphery, and core-core pairs are p, kp, and k'^p, 
respectively. Note that p G [0, 1] and k e [1, We fix iV = 100, d = ^, and p = j and 

compute the core-periphery structure for 100 different instances of this random graph ensemble for 
each of the parameter values A; = 1,1. 1,1. 2,. ..,2. We show our results in Fig. [2j These synthetic 
networks have a discrete core-periphery structure, whereas our algorithm outputs a continuous 
ranking, which we recall makes core score a notion of centrality. 

In Fig. [21 we display the fraction of nodes that are correctly identified as one of the top 50 
core nodes (because the networks have Nd = 50 core nodes by construction). We compare our 
results to those obtained using the continuous core-detection algorithm in UCINet using the so- 
called "minres" algorithm in Version 6.289 |20] . (Whenever two nodes had an identical score, the 
tie was broken at random.) We also show the results of attempting to determine the core nodes 
using various types of centrality, which are designed to measure notions of importance of nodes. 

In Core Score, which examines core-periphery structure as a new kind of centrality, nodes are 
more likely to be part of a network's core both if they have high strength and if they are connected 
to other nodes in the core. The latter idea is reminiscent of ideas like eigenvector centrality and 
PageRank centrality, which recursively define nodes as important based on having connections 
to other nodes that are important [l]. We will also compare Core Score with various notions of 
centrality below when we discuss our example of political voting networks. 
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Figure 2: Fraction of core nodes correctly identified by computing core score for random networks 
CP(100, .5, .25, A;) averaged over 100 realizations. We compare this fraction to those obtained 
using the continuous core-detection algorithm in UCINet (Version 6.289) and various other types 
of centralities. 
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4 Real Networks 



In this section, we examine core-periphery structures in several real-world data sets. We consider 
another example in the online Supplementary Information. 

4.1 The Zachary Karate Club 

We first consider the infamous Zachary Karate Club |26| , which consists of friendship ties between 
34 members of a university karate club in the U.S. in the 1970s (we used the unweighted version for 
this purpose). A conflict led the club to split into two new clubs, and this network has become one 
of the standard benchmark examples for investigations of community structure [11[5]. We visualize 
the network in Fig. [3l where we have identified the nodes according to the split that occurred as a 
result of a longstanding disagreement between the instructor, Mr. Hi, and the club president, John 
A0. These two primary actors are represented by nodes 1 and 34, respectively. 
Table [1] shows the nodes with their global core values, which we computed using Core Score. 
Unsurprisingly, the main actors (nodes 1 and 34) have the highest mean core scores. One can see 
more structure by considering all values of the parameters a and /? rather than averaging over 
them. In particular, the fact that node 1 has the highest mean core value does not imply that it 
has the highest value of Cj*(a, /?) for all a and /?. In Fig. HI we show how the top node varies with 
respect to a and /3. Node 1 has the highest core value only 16% of the time, whereas node 34 is 
the top node 44% of the time. 



Figure 3: The Zachary Karate Club network [26], visualized using the implementation of the 
Kamada-Kawai algorithm [27] of Ref. [28] . The colors represent the two groups into which the club 
split while it was under study. 



One can illustrate this result by plotting the core quality i? ([7]) as a function of a and /3 (see 
Fig. [5]). The regions in which node 1 has the highest Core Score typically correspond to those in 

^ These names are pseudonyms introduced in Ref. [26] 
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Table 1: Nodes in the Zachary Karate Club network nodes along with their global core values. 



Node 


Core Score 


Node 


Core Score 


1 


1.0000 


6 


.0681 


34 


.8876 


26 


.0641 


3 


.8487 


25 


.0631 


2 


.7720 


11 


.0455 


33 


.7126 


5 


.0448 


4 


.5927 


22 


.0347 


14 


.4740 


18 


.0345 


9 


.4658 


15 


.0296 


Q 
O 






.U290 


31 


.3192 


23 


.0287 


24 


.2763 


16 


.0284 


32 


.2507 


19 


.0278 


30 


.2149 


10 


.0275 


20 


.1569 


13 


.0265 


28 


.1248 


27 


.0162 


29 


.1128 


17 


.0135 


7 


.0702 


12 


.0027 




Figure 4: The Zachary Karate Club network's top Core Score node as a function of a and (5. 

which the aggregate core score (i.e., the core quality R) is the highest in Fig.[5j One can distinguish 
one band for each of the top three nodes (node 34 in green, node 3 in red, and node 1 in blue). The 
areas in which node 1 has the top Core Score are those in which the core size is smallest. Although 
node 34 has a slightly higher degree, node 1 has the densest small core of blue nodes around it. 
As the core size becomes larger, more of the red nodes are added to the core and node 3, which 
has more connections to red nodes, becomes more important to the network than node 1. As the 
core becomes very large, node 34 ends up at the top simply because it has the highest degree. The 
landscape of top core nodes can be very complicated, especially as one considers larger networks, 
but it provides a good illustration of how Core Score works in the case of the Zachary Karate Club. 

4.2 The London Underground 

We compute core values for the London Underground ('Tube') transportation network, which ex- 
hibits a strong core-periphery structure but a weak community structure. We collected the data 
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Figure 5: The Zachary Karate Club core quality {R) score as a function of the parameters a and 
/3. 



for this example using the website for the London Underground ( |http : //www . tf 1 . gov . uk ) . The 



Tube network that we assembled has 317 nodes (one for each station) and weighted edges that 
represent the number of direct, contiguous connections between two stations. For example, Baker 
Street and Paddington share an edge of weight 2, as they are adjacent stations on the Circle Line 
and the Hammersmith &: City Line. They are also connected by the Bakerloo Line; however, they 
are not adjacent stations on this line, so it does not affect the weight of the edge between them. 

Ccomunity detection (as detected by optimizing modularity at the default resolution value using 
the Louvain [29] algorithm) splits this network into 21 communities, with the largest community of 
size 19. Most of these communities are formed by groups of stations on a single line. Hence, these 
communities do not appear to give much information about this network. 

Table [2] shows the result of applying the Core Score method to the London Tube network. We 
list the top ten stations and their corresponding Core Scores. 



Node Core Score 

King's Cross St. Pancras 1.0000 

Baker Street 0.8339 

Waterloo 0.8175 

Willesden Junction 0.7973 

Bank 0.7789 

West Ham 0.7516 

Green Park 0.7447 

Oxford Circus 0.7200 

Liverpool Street 0.7167 

Paddington 0.6799 



Table 2: The ten most core- like nodes in the London Underground network along with their Core 
Scores. 
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4.3 Networks of Network Scientists 



We now consider networks of co-authorships between scientists working on network science. We 
study two such networks — one from 2006 ^0] and another from 2010 [31] . These networks, which 
both concentrate on papers written by physicists, have 379 and 552 nodes, respectively, in their 
largest connected components. The nodes correspond to scholars working in the field of network 
science, and an edge between two of them has a weight based on the number of papers that they 
have co-authored. (Note that the 2006 network is not a subset of the 2010 network.) We show 
the names of the scientists with the top thirty core scores for both networks in Table 2 in the 
Supplementary Information. 




Figure 6: Visualization of the 2010 network of network scientists. Each pie chart represents a 
community and the colors represent the Core Score rank order of its members. Each pie seg- 
ment represents a single node, and larger pies contain more nodes. We located the centers of the 
communities using the Kamada-Kawaii algorithm |27y28|. using the code described in Ref. 



The networks of network scientists give a pair of examples that have both sensible community 
structure and sensible core-periphery structure [recall the block model in Fig. [U^c) and (d)]. We 
illustrate this point in our visualization of the network in Fig. [6l Each pie chart represents a 
community, as detected by optimizing modularity using the Louvain [29] algorithm. Each pie is 
composed of the nodes in a single community, and each node is colored according to its Core Score. 
One can plainly see that the network's core is distributed throughout the various communities, and 
that each community has both core and periphery nodes. 

The selected community detection implementation splits the 2006 network into 19 communities 
and the 2010 network into 25 communities, although different community detection heuristics 
and random seeds will yield somewhat different sets of groups [23]. For example, one previous 
examination [32] of the community structure of the 2006 network of network scientists identified 
and described a variety of partitions, (the highest modularity there coming by way of a divisive 
spectral routine which initially separates the connected component into three large groups: one in 
which A.-L. Barabasi is the key node (in the sense of having the largest community centrality in 
the group), one in which M. E. J. Newman is the key node, and one in which A. Vespignani and 
R. Pastor-Satorras are the two key nodes. As shown in Table 2 in the Supplementary Information, 
all four of these nodes have very high Core Scores. 
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The communities in both the 2006 and 2010 networks exhibit a core-periphery structure. Each 
of the communities in the two networks of network scientists are composed of one or two highly 
connected nodes and several other nodes with low strengths. In the 2006 network, the mean 
strength is 4.8, and 17 of the 19 communities contain a node with strength at least 9. (There are 43 
such nodes in the entire network.) In the 2010 network, the mean strength is 4.7, and 20 of the 25 
communities contain a node of at least strength 10. (There are 50 such nodes in the entire network.) 
Additionally, as indicated above, the core nodes are distributed throughout the communities. In 
the 2006 network, 12 of the 19 communities contain at least one node among those with the top 30 
Core Scores; in the 2010 network, 9 of the 25 communities contain at least one node in the top 30. 

4.4 U.S. Congress Voting Network 

Finally, we consider United States Congress roll-call similarity networks. One can build such a 
network from a single 2-year Congress of either the Senate or the House of Representatives |33H35] . 
For each House and Senate, one constructs a complete weighted network in which a node represents 
a legislator and the edge between two legislators gives the fraction of bills on which they both voted 
in the same manner. That is, the adjacency matrix element Aij is equal to the number of times 
that legislator i and j voted in the same way divided by the total number of bills on which both 
i and j cast a vote. This type of network is called a "similarity network", because the weights of 
the edges represent a measure of similarity between the nodes to which they are incident. 

We look at the similarity network of the 108*'* Senate, which occurred during the third and 
fourth year of George W. Bush's presidency (2003-2005). We give the individual Core Score for 
each Senator in Table 3 of the Supplementary Information. In Fig. [71 we show scatter plots between 
the strength centrality and different other centrality measures for the 108*'* Senate network. We 
color Republicans in red and Democrats in blue. Notice how community centrality shows a negative 
correlation with strength centrality [30]. Figure [2] confirms that this is sensible: in our synthetic 
networks, community centrality gives a higher rating to periphery nodes than to core nodes when 
the two types of nodes have similar densities. 

Some of the centrality measures in Fig. [7| have been used previously to study Senators and 
Representatives in legislation cosponsorship networks |37[l38j. which have in turn been compared 
to modularity-based measures of political partisanship studied using roll-call voting networks |39] . 
As one can see from Fig. [71 these different centrality measures do indeed measure different things. 
Observe that, in this example, none of the centrality measures by themselves separate the commu- 
nities very well, whereas a combination of two of them often does. Investigation of core-periphery 
structure using Core Scores thus complements examination of community structure by allowing 
one to examine a different type of meso-scale structure in networks such as Congressional voting 
networks. It also nicely complements existing centrality measures. 

5 Conclusions and Discussion 

In conclusion, we have proposed a new method to investigate core-periphery structure in networks. 
This complements investigations of network community structure, which has been considered at 
great length and from myriad perspectives. By contrast, there are comparatively few methods to 
study core-periphery structure, which we believe is just as important as community structure. As 
we have illustrated, networks can contain community structure, core-periphery structure, or both. 
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Strength 



Strength 




Figure 7: Scatter plots between strength and various centrality measures for the 108*'' Senate 
network. The betweenness centrahty was obtained by turning the network into a binary network 
(threshold: .6) and using code from [36]. 
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For example, the 2006 and 2010 networks of network scientists clearly exhibit both types of meso- 
scale structures. In these networks the core-periphery structure reveals a global 'infrastructure' 
that remains invisible if one searches only for community structure. 

In contrast to the wealth of attention given to community structure over the last decade, the 
development of new methods to examine core-periphery structure is in its infancy. We have shown 
using a synthetic family of networks that our method performs well in detecting artificial core- 
periphery structures. The purpose of the present paper is conceptual development, and our current 
implementation of our method is very slow because of its use of simulated annealing. Moreover, 
we considered 10000 different values of (a, /3) in each use of the method, but this should not be 
necessary, so one can also speed up the method by intelligent choice of which parameter values to 
consider. 

Many networks contain meso-scale structures in addition to (or instead of) community structure, 
and continued pursuit of methods to investigate them should prove fruitful. As we have illustrated, 
core-periphery structure provides one example that is worth further attention. 
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Supplementary Information 



Simulated Annealing 

The simulated annealing Matlab code we used for the purpose of this paper was written by Joachim 
Vandekerckhove j4Uj . using the following parameters. Initial temperature: 1, stop temperature: 
10^^, cooling schedule: .8*T, maximum number of consecutive rejections: 1000, maximum number 
of tries within one temperature: 300, maximum number of successes within one temperature: 20. 

Network of Network Scientists 

Table [3] show the names and Core Scores of the top 30 nodes for both the 2006 and 2010 network 
of network scientists. 



NNS2006 Node 


Core Score 


NNS2010 Node 


Core Score 


RflrflViflt;i A -Tj 


1.00 


1 f (Xl (X P^ClOl ^ - \ • 1 J , 


1.00 


Tpotip' i~r 


0.92 


Tpon P" l~r 


0.88 


Oltvai Z N 


0.69 


Roppalptti ?1 

J-J'JVjVjfXldJ Li, . 


0.67 


Npwman MKT 


0.60 


Newman MKT 


0.65 


P 3,st or- S 3jt orr 3iS , R. . 


0.56 




0.62 


A/Tnrpnn \^ 

L\ IXJi.KLi.i.KJ ^ J. ■ 


0.43 


Virspk T 


0.51 


Vespignani, A. 


0.43 


Pastor-Satorras, R. 


0.49 


Vicsek, T. 


0.40 


Kahng, B 


0.49 


Boccaletti, S. 


0.40 


Oltvai, Z. N. 


0.49 


Vazquez, A. 


0.36 


Moreno, Y 


0.48 


Holme, P. 


0.34 


Arenas, A. 


0.46 


Kahng, B. 


0.33 


Vespignani, A. 


0.43 


Diaz-Guilera, A. 


0.33 


Diaz-Guilera, A. 


0.39 


Guimera, R. 


0.32 


Kurths, J. 


0.37 


Sole, R. 


0.32 


Kertesz, J. 


0.37 


Amaral, L. A. N. 


0.31 


Guimera, R. 


0.37 


Kurths, J. 


0.31 


Caldarelli, G. 


0.36 


Ravasz, E. 


0.31 


Amaral, L. A. N. 


0.35 


Barthelemy, M. 


0.29 


Crucitti, P. 


0.33 


Albert, R. 


0.29 


Vazquez, A. 


0.32 


Arenas, A. 


0.28 


Barthelemy, M. 


0.32 


Neda, Z. 


0.28 


Sole, R. V. 


0.30 


Kim, B. 


0.25 


Kleinberg, J. 


0.30 


Farkas, I. 


0.24 


Fortunato, S. 


0.30 


Latora, V. 


0.24 


Porter, M. A. 


0.28 


Schubert, A. 


0.23 


Farkas, I. J. 


0.27 


Boguna, M. 


0.23 


Ravasz, E. 


0.27 


Watts, D. J. 


0.21 


Tomkins, A. S. 


0.27 


Caldarelli, G. 


0.21 


Goh, K. I. 


0.26 


Strogatz, S. H. 


0.19 


Kaski, K. 


0.26 



Table 3: The 30 nodes with the top Core Scores from the 2006 (left) and 2010 (right) networks of 
network scientists. 
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Core Score and UCINet correlation plot for the 108 Senate 

In Figure [8] we have plotted the correlation scatter plot between the continuous core scores of the 
108*'' Senate for Core Score and the MINRES algorithm in UCINet respectively. The individual 
Core Score of each Senator can be found in Table [H Republicans were the majority party in the 
108*'^ Senate, so they tend to have higher Core Scores than Democrats. A notable exception is Zell 
Miller [D - GA], whose collaboration and common voting patterns with Republican Congressmen 
is well-known |1HI39|. The Democrat with the second highest Core Score is John Kerry [D - MA], 
the 2004 Democratic nominee for President. 



0.115 



« 0.11 



0.4 0.6 
CoreScore 



Figure 8: Correlation scatter plot between the continuous core scores of the 108 Senate, calculated 
with Core Score and the MINRES algorithm in UCINet respectively. 



17 



ode 


Core Score 


Party Vote 


Mitch McConncll [R - KYJ 


1.0000 


98% 


Thad Cochran [R - MS] 


0.9550 


98% 


Jim Running [R - KY] 


0.9296 


97% 


Conrad Burns [R - MT] 


0.8957 


96% 


Bill Frist [R - TN] 


0.8754 


97% 


Orrin Hatch [R - UT] 


0.8609 


97% 


Pete Domoniei [R - NM] 


0.8373 


96% 


Saxby Chambliss [R - OA] 


0.8070 


97% 


Pat Roberts [R - KS] 


0.7950 


97% 


Chuck Grassley [R - lA] 


0.7779 


97% 


Bob Bennett [R - UT] 


0.7608 


97% 


John Cornyn [R - TX] 


0.7188 


96% 


Larry Craig [R - ID] 


0.7166 


96% 


Mike Crapo [R - ID] 


0.7049 


96% 


Kit Bond [R - MO] 


0.6908 


96% 


Jim Talent [R - MO] 


0.6618 


97% 


Sam Brownback [R - KS] 


0.6158 


96% 


Ted Stevens [R - AK] 


0.6156 


96% 


Lamar Alexander [R - TN] 


0.5912 


97% 


Liddy Dole [R - NC] 


0.5795 


96% 


James Inhofe [R - OK] 


0.5793 


96% 


Craig Thomas [R - WY] 


0.5535 


95% 


George Allen [R - VA] 


0.5423 


96% 


Trent Lett [R. - MS] 


0.5153 


95% 


Wayne Allard [R - CO] 


0.5052 


95% 


Richard Shelby [R - AL] 


0.5028 


95% 


Richard Lugar [R - IN] 


0.4913 


96% 


Jeff Sessions [R - AL] 


0.4620 


94% 


Chuck Hagel [R - NE] 


0.4617 


95% 


Mike Enzi [R - WY] 


0.4544 


94% 


Zell Miller [D - GA] 


0.4437 


38% 


Lindsey Graham [R - SC] 


0.4263 


94% 


Rick Santorum [R - PA] 


0.4209 


94% 


Lisa Murkowski [R - AK] 


0.4092 


95% 


John Warner [R - VA] 


0.3997 


94% 


Donald Nickles [R - OK] 


0.3953 


93% 


John Kerry [D - MA] 


0.3942 


98% 


Norm Coleman [R - MN] 


0.3897 


94% 


Paul Sarbanes [D - MD] 


0.3885 


96% 


Gordon Smith [R - OR] 


0.3808 


94% 


Debbie Stabenow [D - MI] 


0.3721 


96% 


Barbara Mikulski [D - MD] 


0.3720 


96% 


Jon Kyi [R - AZ] 


0.3602 


93% 


Peter Fitzgerald [R - IL] 


0.3531 


93% 


Ben Campbell [R - CO] 


0.3439 


93% 


Kay Bailey Hutchison [R - TX] 


0.3418 


92% 


Carl Levin [D - MI] 


0.3409 


95% 


John Edwards [D - NC] 


0.3385 


96% 


George Voinovich [R - OH] 


0.3278 


92% 


John Sununu [R - NH] 


0.3255 


91% 


Mike DeWine [R - OH] 


0.3184 


91% 


Patty Murray [D - WA] 


0.3137 


95% 


John Ensign [R - NV] 


0.3103 


90% 


Arlen Specter [R - PA] 


0.2991 


85% 


Barbara Boxer [D - CA] 


0.2972 


95% 


Judd Gregg [R - NH] 


0.2953 


90% 


Dick Durbin [D - IL] 


0.2929 


95% 


Susan Collins [R - ME] 


0.2911 


84% 


Hillary Clinton [D - NY] 


0.2900 


95% 


John McCain [R - AZ] 


0.2844 


84% 


Jack Reed [D - RI] 


0.2840 


95% 


Olympia Snowe [R - ME] 


0.2810 


82% 


Lincoln Chafce [R - RI] 


0.2724 


78% 


Ben Nelson [D - NE] 


0.2676 


72% 


Edward Kennedy [D - MA] 


0.2651 


94% 


Tim Johnson [D - SD] 


0.2615 


94% 


John Breaux [D - LA] 


0.2611 


74% 


Chuck Schumer [D - NY] 


0.2584 


94% 


Max Baucus [D - MT] 


0.2566 


82% 


Mary Landrieu [D - LA] 


0.2527 


85% 


Blanche Lincoln [D - AR] 


0.2484 


87% 


Ron Wyden [D - OR] 


0.2475 


93% 


Jon Corzine [D - NJ] 


0.2473 


95% 


Tom Daschle [D - SD] 


0.2469 


94% 


Chris Dodd [D - CT] 


0.2450 


94% 


Mark Pryor [D - AR] 


0.2437 


89% 


Maria Cantwell [D - WA] 


0.2425 


95% 


Patrick Leahy [D - VT] 


0.2414 


94% 


Evan Bayh [D - IN] 


0.2396 


86% 


Harry Reid [D - NV] 


0.2394 


93% 


Byron Dorgan [D - ND] 


0.2378 


91% 


Herb Kohl [D - WI] 


0.2376 


94% 


Kent Conrad [D - ND] 


0.2310 


88% 


Tom Carper [D - DE] 


0.2285 


86% 


Bill Nelson [D - FL] 


0.2245 


93% 


Jay Rockefeller [D - WV] 


0.2150 


93% 


Dianne Feinstein [D - CA] 


0.2115 


92% 


Jefl Bingaman [D - NM] 


0.2091 


92% 


Mark Dayton [D - MN] 


0.2009 


93% 


Daniel Akaka [D - HI] 


0.1940 


94% 


Frank Lautenberg [D - NJ] 


0.1933 


94% 


Joseph Lieberman [D - CT]].8 


0.1688 


93% 


Daniel Inouye [D - HI] 


0.1671 


93% 


Joe Biden [D - DE] 


0.1669 


92% 


Tom Harkin [D - lA] 


0.1570 


94% 


James Jeffords [I - VT] 


0.1269 


88% 


Russ Peingold [D - WI] 


0.1092 


91% 


Bob Graham [D - FL] 


0.1048 


93% 


Fritz Hollings [D - SC] 


0.0727 


88% 


Robert Byrd [D - WV] 


0.0717 


90% 



